Active Learning with c-Certainty
نویسندگان
چکیده
It is well known that the noise in labels deteriorates the performance of active learning. To reduce the noise, works on multiple oracles have been proposed. However, there is still no way to guarantee the label quality. In addition, most previous works assume that the noise level of oracles is evenly distributed or example-independent which may not be realistic. In this paper, we propose a novel active learning paradigm in which oracles can return both labels and confidences. Under this paradigm, we then propose a new and effective active learning strategy that can guarantee the quality of labels by querying multiple oracles. Furthermore, we remove the assumptions of the previous works mentioned above, and design a novel algorithm that is able to select the best oracles to query. Our empirical study shows that the new algorithm is robust, and it performs well with given different types of oracles. As far as we know, this is the first work that proposes this new active learning paradigm and an active learning algorithm in which label quality is guaranteed.
منابع مشابه
Relational Active Learning for Joint Collective Classification Models
In many network domains, labeled data may be costly to acquire—indicating a need for relational active learning methods. Recent work has demonstrated that relational model performance can be improved by taking network structure into account when choosing instances to label. However, in collective inference settings, both model estimation and prediction can be improved by acquiring a node’s labe...
متن کاملReducing systematic review workload through certainty-based screening
In systematic reviews, the growing number of published studies imposes a significant screening workload on reviewers. Active learning is a promising approach to reduce the workload by automating some of the screening decisions, but it has been evaluated for a limited number of disciplines. The suitability of applying active learning to complex topics in disciplines such as social science has no...
متن کاملShift of “Certainty” in Pre- and Post-Citation Arguments: The Case of Textbooks in Applied Linguistics
Writing academic texts by novice researchers requires a framework and support by learning how to cite the works of others. However, compared to the studies on other academic writings, studying citations by considering certainty markers has received little attention. The main purpose of this study was to investigate the shifts of certainty markers (hedges and boosters) in pre- and post-citation ...
متن کاملActive learning as a means to distinguish among prominent decision strategies
A long-standing debate in decision making has been whether people rely on very little information for making choices, or weigh and add all available information. We propose a new method to determine whether a non-compensatory (Take-TheBest) or compensatory strategy (Logistic Regression) is more psychologically plausible: by looking at peoples active learning queries. This method goes beyond tra...
متن کاملActive learning for spoken language understanding
In this paper, we describe active learning methods for reducing the labeling effort in a statistical call classification system. Active learning aims to minimize the number of labeled utterances by automatically selecting for labeling the utterances that are likely to be most informative. The first method, inspired by certainty-based active learning, selects the examples that the classifier is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012